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Abstract 

Many centralized two-sided markets form a matching between par- 
ticipants by running a stable marriage algorithm. It is a well-known 
fact that no matching mechanism based on a stable marriage algo- 
rithm can guarantee truthfulness as a dominant strategy for partic- 
ipants. However, as we will show in this paper, in a probabilistic 
setting where the preference lists of one side of the market are com- 
posed of only a constant (independent of the the size of the market) 
number of entries, each drawn from an arbitrary distribution, the 
number of participants that have more than one stable partner is van- 
ishingly small. This proves (and generalizes) a conjecture of Roth 
and Peranson [23]. As a corollary of this result, we show that, with 
high probability, the truthful strategy is the best response for a given 
player when the other players are truthful. We also analyze equilib- 
ria of the deferred acceptance stable marriage game. We show that 
the game with complete information has an equilibrium in which a 
(1 — o(l)) fraction of the strategies are truthful in expectation. In the 
more realistic setting of a game of incomplete information, we will 
show that the set of truthful strategies form a (l + o(l))-approximate 
Bayesian-Nash equilibrium. Our results have implications in many 
practical settings and were inspired by the work of Roth and Peran- 
son [23] on the National Residency Matching Program. 

1 Introduction 

Suppose all the eligible bachelors and bachelorettes in a town 
confide in the town's matchmaker their ideal spouses. Each 
man submits an ordered preference list of the women he 
would like to marry. Similarly, each woman submits an 
ordered preference list of the men she would like to marry. 
The matchmaker must arrange marriages such that no one is 
tempted to ask for a divorce. In particular, the matchmaker 
must be sure that there is no pair of young lovers who prefer 
each other to their assigned spouses. Such a set of marriages 
is called stable, and finding a set of stable marriages is known 
as the stable marriage problem. Gale and Shapley [6] showed 
that the stable marriage problem always has a solution and 
developed an algorithm to find it. Since the seminal work 
of Gale and Shapley, there has been a significant amount of 
work on the mathematical structure of stable marriages and 
related algorithmic questions. See, for example, the book by 
Knuth [10], the book by Gusfield and Irving [8], or the book 
by Roth and Sotomayoror [24]. 

The stable marriage problem has many promising appli- 
cations in two-sided markets such as job markets [19], college 
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admissions [19], sorority and fraternity rush [14], and assign- 
ment of graduating rabbis to their first congregation [3], Since 
most applications of the stable marriage algorithm involve the 
participation of independent agents, it is natural to investigate 
how we should expect these agents to behave. In particular, 
we would like to know whether agents can benefit by being 
dishonest about their preference lists. Ideally, in economic 
settings such as job markets, we would like to design mech- 
anisms in which truth-telling is a dominant strategy, i.e., it is 
in the best interest of each individual agent to tell the truth, 
no matter what other agents do. We call such a mechanism a 
truthful mechanism. Truthful mechanisms have received sig- 
nificant attention in the computer science community (see, for 
example, [1, 5, 15]). Unfortunately, as shown by Roth [20], 
there is no mechanism for the stable marriage problem in 
which truth-telling is a dominant strategy for both men and 
women. See the book by Roth and Sotomayor [24] for a dis- 
cussion about this problem and other problems related to the 
economic aspects of the stable marriage problem. 

Nonetheless, stable matching algorithms have had spec- 
tacular success in practical applications. One particular job 
market — the medical residency market — has been using a 
centralized stable marriage market system called the National 
Residency Matching Program (NRMP) since the 1950s [21]. 
To this day, most medical residences are formed through an 
updated version of this centralized market system redesigned 
in 1998 by Roth [22]. It seems surprising that an algorithm 
like the one used by the NRMP which provably admits strate- 
gic behavior can be so successful. Roth and Peranson [23] 
noted that, in practice, very few students and hospitals could 
have benefited by submitting false preferences. For exam- 
ple, in 1996, out of 24,749 applicants, just 21 could have 
affected their match by submitting false preferences (assum- 
ing, of course, that no one lied in 1996). Roth and Peran- 
son [23] conjectured that the main reason for this peculiarity 
is the sheer size of the job market. In a small town, every 
man knows every woman, but in the medical market, a stu- 
dent can not possibly interview at every hospital. In practice, 
the length of applicant preference lists is quite small, about 
15, while the number of positions is large, about 20,000. Ex- 
perimentally, Roth and Peranson [23] showed that in a model 
where random preference lists of limited length are generated 
for participants, the number of participants who have more 
than one stable partner (and therefore the number of those 
who can benefit by lying) is small. They conjectured that in 
this probabilistic setting, the fraction of such people tends to 



zero as the size of the market tends to infinity. 

In this paper, we prove a statement that proves and 
generalizes this conjecture. More precisely, we prove the 
following: Assume there are n men and n women in the 
town, and each woman has an arbitrary ordering of all men as 
her preference list. Each man independently picks a random 
preference list of a constant (i.e., independent of n) number of 
women, each according to an arbitrary distribution !2>, These 
are the true preference lists. We show that in this setting the 
expected number of people with more than one stable spouse 
is vanishingly small. This result has a number of interesting 
economic implications. We can interpret the preference lists 
together with a stable marriage algorithm as a game G, in 
which everybody submits a preference list (not necessarily 
their true preference list) to the algorithm and receives a 
spouse. The goal for each player is to receive the best spouse 
possible according to their true preference list. First, we show 
that, with probability 1 — o(l) (as n approaches infinity), 
in any stable marriage mechanism, the truthful strategy is 
the best response for a given player when the other players 
are truthful. We also show that when a deferred acceptance 
mechanism is used, there is an equilibrium of this game in 
which a majority of the players are truthful. Finally, we prove 
that in the more realistic setting of a game of incomplete 
information (where each player only knows the distribution 
of the preference lists), the set of truthful strategies in the 
game induced by the women-proposing mechanism form a 
(1 + o(l))-approximate Bayesian-Nash equilibrium. It is 
important to note that our results hold for any distribution 
@ of women. For the special case of uniform distributions 
(which includes the conjecture of Roth and Peranson), the 
o(l) in the above bounds is roughly e /n, and thus the bounds 
converge quite quickly. 

We use the following technique for our proof: First, 
we design an algorithm, based on an algorithm of Knuth et 
al. [11, 12], that for a given woman checks whether she has 
more than one stable husband in one run of proposals. Using 
this algorithm, we prove a relationship between the probabil- 
ity that a given woman has more than one stable husband and 
the number of single women who are more popular than she. 
This relationship, essential to our main result, seems difficult 
to derive directly, without going through the algorithm. Given 
this relationship, we are able to derive our result by comput- 
ing bounds on the expectation and variance of the number of 
single popular women. 

Related work. This paper is motivated by experimen- 
tal results and a conjecture in the paper by Roth and Per- 
anson [23]. Sethuraman et al. [25, 26] have studied the 
stable matching game when participants are required to an- 
nounce complete preference lists, and have given an optimal 
cheating algorithm and several experimental results regard- 
ing the chances that an agent can benefit by lying in this 
game. One can also view our results as an analysis of sta- 
ble matching with random preferences. There has been a 



considerable amount of work on this area, mostly assuming 
complete preference lists for participants, and none motivated 
by the economic aspects of the problem. See, for example, 
[11, 12, 18, 17]. We will use some of the techniques devel- 
oped in these papers in our analysis. 

Mechanisms that are truthful in a randomized sense (i.e., 
in expectation, or with high probability) have been a subject 
of research in theoretical computer science [1, 2]. These 
mechanisms seeks to encourage truthfulness by introducing 
randomization into the mechanism. Our results are of a 
different flavor. We show that one can conclude statements 
regarding truthfulness by introducing randomization into the 
players utility functions. To the best of our knowledge, our 
result is the first result of this type. 

Structure of the paper. The rest of the paper is orga- 
nized as follows. In Section 2, we define the stable marriage 
problem and discuss the relevant known results from the lit- 
erature. In Section 3, we formalize our probabilistic setting 
and summarize our results. In Section 4, we prove our main 
technical result, the experimental conjecture of Roth and Per- 
anson [23]. In Section 6, we strengthen our technical result 
for the special case of uniformly distributed preference lists. 
Finally, in Section 7, we conclude with interesting open ques- 
tions concerning stable marriage algorithms and two-sided 
markets. 

2 Stable marriage preliminaries 

Consider a community consisting of a set W of n women and 
a set ^ of n men. Each person in this community has a 
preference list, which is a strictly ordered list of a subset of 
the members of the opposite sex. A matching is a mapping [i 
from M U W to „# U W in such a way that for every x e ^#, 
\x{x) e W U {x} and for every x e W, n(x) e J6 U {x}, 
and also for every x,y £ ^ U W , x = fi(y) if and only if 
y = fi(x). If for some to € ^# and w <G W , /i(to) = w, 
we say that w is the wife of to and to is the husband of w 
in [i; or, if for some x e ./# U W , /j,(x) = x, we say that 
x remains single in /1. A pair to € ^£, w € W is called a 
blocking pair for a matching /1, if m prefers w to /z(m), and 
w prefers to to fi(w). A matching with no blocking pair is 
called a stable matching. If a man to and a woman w are a 
couple in some stable matching /j,, we say that to is a stable 
husband of w, and wis a stable wife of to. Naturally, each 
person might have more than one stable partner. In the stable 
marriage problem, the objective is to find a stable matching 
given the preference lists of all men and women. 

The stable marriage problem was first introduced and 
studied by Gale and Shapley [6] in 1962. They proved that a 
stable matching always exists, and a simple algorithm called 
the deferred acceptance procedure can find such a matching. 
This procedure iteratively selects an unmarried man to and 
creates a proposal from him to the next woman on his list. 
If this woman prefers to to her current assignment, then she 
tentatively accepts to's proposal, and rejects the man she 



was previously matched to (if any); otherwise, she rejects 
the proposal of m. The algorithm ends when every man 
either finds a wife that accepts him, or gets rejected by all 
the women on his list, in which case he remains single. This 
algorithm is sometimes called the men-proposing algorithm. 
Similarly, one can define the women-proposing algorithm. 
Gale and Shapley [6] proved the following. 

THEOREM A. The men-proposing algorithm always finds a 
stable matching fi. Furthermore, this stable matching is men- 
optimal, i.e., for every man m and every stable wife w of m 
other than /u(m), m prefers /J,(m) to w. At the same time, fi 
is the worst possible stable matching for women, i.e., for any 
woman w and any stable husband m of w other than n(w), 
w prefers m to /i(u>). 

The men-optimal stable matching is unique, and so the 
above theorem implies that the order of proposals does not 
affect the output of the men-proposing algorithm. 

THEOREM B . The men-proposing algorithm always finds the 
same stable matching, independent of the order in which the 
proposals are made. 

We will also need the following theorem of Roth [21] 
and McVitie and Wilson [13], which says that the choice of 
the stable matching algorithm does not affect the number of 
people who remain unmarried at the end of the algorithm. 

THEOREM C. In all stable matchings, the set of people who 
remain single is the same. 

A stable matching mechanism is an algorithm that elicits 
a preference list from each participant, and outputs a match- 
ing that is stable with respect to the announced preferences. 
We say that truthfulness is a dominant strategy for a partic- 
ipant a if, no matter what strategy other participants use, a 
cannot benefit (i.e., improve his or her match according to 
his or her true preferences) by submitting a list other than 
his or her true preference list. Ideally, we would like to de- 
sign mechanisms in which truthfulness is a dominant strategy 
for all participants. However, Roth [20] proved that there is 
no such mechanism for the stable marriage problem. On the 
positive side, the following theorem (due to Roth [20] and 
Dubins and Freedman [4]) shows that in deferred acceptance 
mechanisms, truthfulness is a dominant strategy for half the 
population. 

THEOREM D. In the men-optimal stable marriage mecha- 
nism, truth-telling is a dominant strategy for men. Similarly, 
in the women-optimal mechanism, truth-telling is a dominant 
strategy for women. 

3 Our results 

Consider a situation where there are n men and n women. 
Assume the preference list of each man is chosen indepen- 
dently and uniformly at random from the set of all ordered 



lists of k women, and the preference list of each woman is 
picked independently and uniformly at random from the set 
of all orderings of all men. We are concerned about bounding 
the expected number of people who might be tempted to lie to 
the mechanism about their preferences when the other play- 
ers are truthful. As we will show, only people who have more 
than one stable partner might be able to influence their final 
match by altering their preference lists. Therefore, we focus 
on bounding the expected number of women with more than 
one stable husband in this model. Notice that this number 
is equal to the expected number of men with more than one 
stable wife, since the number of single and uniquely matched 
men must equal the number of single and uniquely matched 
women. Roth and Peranson [23] conjectured the following. 

CONJECTURE 1. Let Cfc(n) denote the expected number of 
women who have more than one stable husband in the above 
model. Then for all fixed k, 

lim ^M = 0. 

n^oo n 

We prove this conjecture in this paper. In fact, we will prove 
the following stronger result. Let Q) be an arbitrary fixed 
distribution over the set of women such that the probability 
of each woman in £> is nonzero. 1 Intuitively, having a 
high probability in @ indicates that a woman is popular. 
The preference lists are constructed by picking each entry 
of the list according to '2, and removing the repetitions. 
More precisely, we construct a random list (l\, . . . , Ik) of k 
women as follows. At step i, repeatedly select women w 
independently according to Si until w ^ {l\, . . . , i,_i} and 
then set li = w. Let $> k be the distribution over lists of size 
k produced by this process. 2 Notice that if £F is the uniform 
distribution, 3> k is nothing but the uniform distribution over 
the set of all lists of size k of women. Therefore, the model 
of Roth and Peranson [23] is a special case of our model. We 
also generalize their result in another respect: we assume that 
women have arbitrary complete preference lists, as opposed 
to the assumption in [23] that they have random complete 
preference lists. Our main result is the following theorem. 

THEOREM 3.1. Consider a situation where each woman has 
an arbitrary complete preference list, and each man has a 
preference list chosen independently at random according to 
Q) . Then, for all fixed k, 

lim ^M = 0. 

n^oo n 

Even though we state and prove our results assuming that 
all preference lists are of size exactly k, it is straightforward 
to see that our proof carries over to the case where preference 
lists are of size at most k. For uniform distributions, we can 
prove a strong result on the rate of convergence of this limit. 
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THEOREM 3.2. Consider a situation where each woman has 
an arbitrary complete preference list, and each man has a 
preference list of k women chosen uniformly and indepen- 
dently. Then, the expected number of women who have more 
than one stable husband is bounded by e k+1 + k 2 , a constant 
that only depends on k (and not on n). 

There are a number of interesting economic implications 
of this theorem. Our first result states that, with high proba- 
bility, a given player's best strategy is truth-telling when the 
other players are truthful. Thus, a dishonest player who be- 
lieves in the honesty of the other players has an economic 
incentive to be honest. This is similar (but not identical) to 
the notion of ex post Nash equilibria. 

COROLLARY 3.1. Fix any stable matching mechanism, and 
consider an instance with n women with arbitrary complete 
preference lists and n men with preference lists drawn from 
'S (as in Theorem 3.1). Then, for any given person x, the 
probability (over the men 's preference lists) that for x, the 
truthful strategy is not the best response in a situation where 
the other players are truthful is o(l) (at most 0(e k /n) for 
uniform distributions). 

The previous corollary states that a player can benefit 
by lying only with a vanishingly small probability when the 
other players are truthful. Now we turn to the situation 
in which the other players are not necessarily truthful, but 
are playing an equilibrium strategy of the game induced by 
the stable matching mechanism. There are two ways to 
interpret our stable marriage setting as a game. One way 
is to consider it as a game of complete information: Let 
P m and P w denote the preference lists of men and women. 
Knowing these preferences, each player chooses a strategy 
from the strategy space of all possible preference lists. The 
corresponding preference lists are submitted to a fixed stable 
marriage algorithm and a matching is returned. A player's 
goal is to choose the strategy that gets him/her a spouse 
as high on his/her preference list as possible. Let Gp mt p vj 
denote this game. An equilibrium of a game is a set of 
strategies, one for each player, such that no single player 
can improve his/her situation by deviating from his or her 
equilibrium strategy [16]. 

COROLLARY 3.2. Assume the preference lists P w of women 
are arbitrary, and the preference lists P m of men are drawn 
from & k (as in Theorem 3.1). The game Gp m ,p uj induced 
by these preferences and the men-proposing (or women- 
proposing) mechanism has an equilibrium in which, in ex- 
pectation, a (1 — o(l)) fraction of strategies are truthful. 

In the above setting, we assume that each player knows 
the preference lists of the other players when he/she is select- 
ing a strategy, i.e., we have a game of complete information. 
A more realistic assumption is that each player only knows 
the distribution of preference lists of the other players. Each 



player's goal is to alter his/her preference list and announce it 
to the mechanism in a way that the expected rank of his/her 
assigned spouse is as high as possible. A strategy for a player 
is a function that outputs an announced preference list for any 
input preference list. Hence the truthful strategy is the identity 
function. We wish to analyze the Bayesian-Nash equilibria in 
this incomplete information game. A (1 + e) -approximate 
Bayesian-Nash equilibrium for this game is a collection of 
strategies, one for each player, such that no single player can 
improve his/her situation by more than a multiplicative factor 
of 1 + e by deviating from his/her equilibrium strategy. 

COROLLARY 3.3. Consider the game described above with 
the women-optimal mechanism. Then for every e > 0, if n 
is large enough, the above game has a (1 + e) -approximate 
Nash equilibrium in which everybody is truthful. 

The proofs of these corollaries are presented in Section 5. 

4 Proof of Theorem 3.1 

In this section, we will prove our main technical result, 
Theorem 3.1. The proof consists of three main components. 
First, we present an algorithm that, given the preference lists, 
counts the number of stable husbands of a given woman 
(Section 4.1). We would like to analyze the probability 
that the output of this algorithm is more than one, over 
a distribution of inputs. In Section 4.2, we bound this 
probability assuming a lemma concerning the number of 
singles in a stable marriage. This lemma is proved in 
Section 4.3 by bounding the expectation of the number of 
singles and proving that it is concentrated around its expected 
value using the Chebyschev inequality. 

4.1 Counting the number of stable husbands The sim- 
plest way to check whether a woman g has more than one 
stable husband or not is to compute the men-optimal and the 
women-optimal stable matchings using the algorithm of Gale 
and Shapley (See Theorem A) and then check if g has the 
same husband in both these matchings. However, analyzing 
the probability that g has more than one stable husband us- 
ing this algorithm is not easy, since we will not be able to 
use the principle of deferred decisions (as described later in 
Section 4.2). In this section we present a different algorithm 
that outputs all stable husbands of a given woman in an ar- 
bitrary stable marriage problem in one run of a men-propose 
algorithm. This algorithm is a generalization of the algorithm 
of Knuth et al. [11, 12] to the case of incomplete preference 
lists. 

Suppose we want the stable husbands of woman g. 
Initially all the people are unmarried (the matching is empty). 
The algorithm closely follows the man-proposing algorithm 
for finding a stable matching. However, g's, objective is 
to explore all her options, therefore, every time the men- 
proposing algorithm finds a stable marriage, g divorces her 
husband and lets the algorithm continue. 



Algorithm 4.1. 

1. Initialization: Run the man-proposing algorithm to find 
the men-optimal stable matching. If g is unmarried, 
output 0. 

2. Selection of the suitor: Output the husband m of g as one 
of her stable husbands. Remove the pair (to, g) from the 
matching (woman g and man to are now unmarried) and 
set b = 77i. (the variable b is the current proposing man.) 

3. Selection of the courted: If b has already proposed to all 
the women on his preference list, terminate. Otherwise, 
let w be his favorite woman among those he hasn't 
proposed to yet. 

4. The courtship: 

(a) If w has received a proposal from a man she 
likes better than b, she rejects b and the algorithm 
continues at the third step. 

(b) If not, w accepts b. If w = g, the algorithm 
continues at the second step. Otherwise, if w was 
previously married, her previous husband becomes 
the suitor b and the algorithm continues at the third 
step. If w was previously unmarried, terminate the 
algorithm. 

Notice that in step 4(a) of the algorithm, w compares b 
to the best man who has proposed to her so far, and not to the 
man she is currently matched to. Therefore, after g divorces 
one of her stable husbands, she has a higher standard, and will 
not accept any man worse than the man she has divorced. For 
w 7^ g, step 4(a) is equivalent to comparing b to the man w is 
matched to at the moment. 

We must prove that this algorithm outputs all stable 
husbands of g. In fact, we will prove something slightly 
stronger. 

THEOREM 4.1. Algorithm 4.1 outputs all stable husbands of 
g in order of her preference from her worst stable husband to 
her best stable husband. 

Proof. We prove the theorem by induction. As the man- 
proposing algorithm returns the worst possible matching for 
the women (by Theorem A), the first output is g's worst stable 
husband. Now suppose the i'th output is g's i'th worst stable 
husband m,. Consider running the man-proposing algorithm 
with g's preference list truncated just before man to^. As the 
order of proposals do not affect the outcome (Theorem B), let 
the order of proposals be the same as Algorithm 4. 1 . Then, 
up until Algorithm 4.1 outputs the i + l'st output mj+i, its 
tentative matching during the j'th proposal is the same as the 
tentative matching of the man-proposing algorithm during the 
j'th proposal (except, possibly, woman g is matched in Al- 
gorithm 4. 1 and unmatched in the man-proposing algorithm). 
Now since m i+1 was accepted, the fourth step guarantees that 



g preferred m i+1 to TOj. Thus m i+1 is on g's truncated prefer- 
ence list, and so the tentative matchings of the two algorithms 
are the same. Furthermore, TOj+i is the first proposal g has ac- 
cepted in the man-proposing algorithm. All other women who 
get married in the set of stable matchings already have hus- 
bands since they have husbands in Algorithm 4.1, and so the 
man-proposing algorithm terminates with the current match- 
ing. Thus, m,+i is the worst possible stable husband for g 
which is better than rrii. 

4.2 Analyzing the expectation We are interested in the ex- 
pected number of women with more than one stable husband, 
or, equivalently, the probability that a fixed woman g has more 
than one stable husband. We can compute this probability by 
analyzing the output of Algorithm 4. 1 on male preference lists 
drawn from the distribution 3> k . We simulate this experiment 
using the principle of deferred decisions: a man only needs 
to determine his i'th favorite woman when he makes his i'th 
proposal. If we make these deferred decisions independently 
according to S>, then the distribution of the output of this new 
algorithm over its coin flips will be exactly the same as the 
distribution of the output of the old algorithm over its input. 
This motivates the definition of the following algorithm. At 
any point in this algorithm, the variable Ai denotes the set of 
women that man i has proposed to so far. Men and women 
are indexed by numbers between 1 and 77. 



Algorithm 4.2. 
1. Initialization: Let I 



1, V 1 < i < n, Ai 



x g = 0. (The matching is empty and no men have made 
any proposals). 

2. Selection of the suitor: 

(a) If I < 77, let b be the Tth man and increase I by one. 

(b) Otherwise, we have found a stable matching. If 
g is single in this stable matching, then terminate. 
Otherwise, increment x g , remove the pair (to, g) 
from the matching (man 777 and woman g who 
were previously married to each other are now 
unmarried) and set b = to. 

3. Selection of the courted: 

(a) If \Ab\ > k, then do the following: 

- If x g > 1, then terminate. Otherwise, return 
to step two. 

(b) Repeatedly select w randomly according to distri- 
bution Q) from the set of all women until w £ A^,. 
Add w to A b . 

4. The courtship: 

(a) If w has received a proposal from a man she 
likes better than 6, she rejects b and the algorithm 
continues at step 3. 



(b) If not, w accepts b. If w was previously married, 
her previous husband becomes the suitor b and 
the algorithm continues at the third step. If w 
was previously single and x g — 0, the algorithm 
continues at the second step. If w was previously 
single and x g > 1, the algorithm continues at the 
second step if w — g and terminates if w ^ g. 

Before giving a proof of Theorem 3.1, we introduce a few 
notations. For every woman i, let pi denote the probability of 
i in the distribution '2. We say that a woman i is more popular 
than another woman j, if pi > pj . Assume, without loss of 
generality, that women are ordered in the decreasing order of 
popularity, i.e., pi > p 2 > ■ ■ ■ > p n . 

Proof of Theorem 3.1. Recall that Ck(n) is the ex- 

pected number of women with more than one stable hus- 
band. We show that for every e > 0, if n is large 
enough, then Ck(n)/n < e. By linearity of expectation, 
Cfc(n) = 5Z e;r Pr[g has more than one stable husband]. 
Fix a woman g £ W . We want to bound the probability that 
g has more than one stable husband. By Theorem 4. 1 and the 
principle of deferred decisions, this is the same as bounding 
the probability that the random variable x g in Algorithm 4.2 
is more than one. 

We divide the execution of Algorithm 4.2 into two 
phases: the first phase is from the beginning of the algorithm 
until it finds the first stable matching, and the second phase is 
from that point until the algorithm terminates. Assume at the 
end of the first phase, Algorithm 4.2 has found the first stable 
matching /i. We bound the probability that x g > 1 condi- 
tioned on the event that /i is the matching found at the end of 
the first phase (we denote this by Pr[x g > 1 | /j]), and then 
take the expectation of this bound over /i. 

Let the set S^ (g) denote the set of women more popular 
than g that remain single in the stable matching \i and 
x »(g) = \S^{g)\. If 9 is single in \i, then x g = 
and therefore Pr[a; s > 1 | /i] = 0. Otherwise, x g > 
1 if only if woman g accepts another proposal before the 
algorithm terminates. We bound this by the probability that 
g receives another proposal before the end of the algorithm. 
The algorithm may terminate in several ways, but we will 
concentrate on the termination condition in step 4(b), i.e., 
that some man proposes to a previously single woman. Thus, 
we are interested in the probability that in the second phase 
of Algorithm 4.2 some man proposes to a previously single 
woman before any man proposes to g. 

Note that at the end of the first phase of the algorithm, 
all Aj's are disjoint from S^(g), since women have complete 
preference lists. Thus whenever the random oracle in step 
3(b) outputs a woman from set S^g), the algorithm will 
advance to step 4(b) and terminate. Thus, the probability 
Pr[x g > 1 | /i] is less than or equal to the probability that 
in a sequence whose elements are independently picked from 
the distribution '3, g appears before any woman in S^(g). 
By the definition of S^(g), for every w <G S fl (g), every time 



we pick a woman randomly according to S>, the probability 
that w is picked is at least as large as the probability that g 
is picked. Therefore, the probability that g appears before all 
elements of S^ (g) in a sequence whose elements are picked 
according to S 1 is at most the probability the g appears first in 
a random permutation on the elements of {g} U S fi (g), which 
is l/(X ll (g) + 1). Thus, for every [i, 

(4.1) P ^>1|M]<^ 



Thus, 
(4.2) 


Pr[x s > 1] = E p 


Pr[x g > 1 | n) 
1 


[x^g) + l\ 





We complete the proof assuming the following lemma, 
whose proof is given in Section 4.3. 



Lemma 4.1. For every g > Ak, 

E[l/(Ay.g) + !)]< 



g 



Thus, using Equation (4.2) and Lemma 4. 1 for g > j^t, 
and Pr[a; s > 1] < 1 for smaller g's, we obtain 



CfcO) < 



I6nk 
ln(n) 



E 



I2 e 8nk /s 



-ln(n) 



16nfc >A 31n(n)e ln <")/ 2 
lnfn) ^— J Ank 

9-EW 

< r^T + iVn\n{n)/{Ak) = o(n), 
m(n) 

and so for every constant k, the fraction of women with more 
than one stable husband, Ck(n)/n, goes to zero as n tends to 
infinity. □ 

For the case of uniform distributions, it is possible to 
modify the above proof to get a much tighter bound of 0(e sk ) 
on the expected number of women with more than one stable 
husband. We derive an even tighter bound in this case, as 
stated in Theorem 3.2, using a slightly different technique. 
This bound is proved in Section 6. 

4.3 Number of singles In this section we prove 
Lemma 4.1. This completes the proof of Theorem 3.1. 
We start with the following simple fact: the probability that 
a woman w remains single is greater than or equal to the 
probability that w does not appear on the preference list 
of any man. More precisely, let E w denote the event that 
the woman w does not appear on the preference list of any 
man when these preferences are drawn from S> k . Let Y g 
denote the number of women w < g for which the event E w 
happens. Then we have the following lemma. 



LEMMA 4.2. For every g, we always have X /1 (g) > Y g ? 

Proof. Every woman w < g for which E w happens is a 
woman who is at least as popular as g and will remain 
unmarried in any stable matching. □ 

We now bound the expectation of l/(Y g + 1) in a 
sequence of two lemmas. In Lemma 4.3 we bound the 
expectation of Y g . Then, in Lemma 4.4 we show the variance 
of Y g is small and therefore it does not deviate far from its 
mean. 

LEMMA 4.3. For g > Ak, the expected number E[X fJ- (g)] 
of single women more popular than woman g is at least 

g p -8nk/g 



Proof Let Q = J2j=iPj denote the total probability of the 
first k women under 2. The probability that a man m does 
not list a woman w as his z'th preference given that he picks 
wi, . . . , Wj_i as his first i — 1 women, is equal to 



1 



Pa 



1 - ELl Pv 



> 1 



pit 



1-Q 



Thus the probability that m does not list w at all is at least 



(1- 



c , and so the probability that woman w is not listed 



Let M be an arbitrarily large constant. The following 
process is one way to simulate the selection of one prefer- 
ence list L = (l\, ... ,1k): Consider the multiset £ consisting 
of |_PjMJ copies of the name of woman i. Pick a random 
permutation tt of S. Let li be the i'th distinct name in n. 
It is not hard to see that as M — > oo, the probability of a 
given list L in this process converges to its probability un- 
der distribution 2) k . Therefore, Pr[Fi] is equal to the limit as 
M — > oo of the probability that k distinct names occur be- 
fore i in 7r. Similarly, if X' denotes the multiset obtained by 
removing all copies of woman j from S, then Pr[Fj|.Fj] is 
equal to the limit as M — > oo of the probability that k distinct 
names occur before i in a random permutation of £'. How- 
ever, this is precisely equal to the probability that k distinct 
names other than j occur before i in a random permutation n 
of S. But that certainly implies that k distinct names (includ- 
ing j) occur before i in tt, and so for every tt where F^Fj hap- 
pens, Fi also happens. Therefore, Pr[i 7 i|F :) ] < Pr[Fi]. Thus, 
Pr[£i A Ej] < Pi[Ei].Pr[Ej], and so the variance a 2 (Y g ) is 

a\Y g ) = E[r 9 2 ]-E[y s ] 2 

9 

-^Pr[^] 2 -2^Pr[^].Pr[^] 

i—1 1<*<J<9 



< ^Pr[^] = E[y s ; 



by any man is at least (1 — j^rh)"' - If w > k, there are at 
least w — k women who are at least as popular as w, but not 
among the k most popular women. Therefore, p w < ^jE^- 
By these two inequalities, for every w > 2k we have 

Pr[£j > (1 — ) nk > e - 2nk/{w - k) > e - Ank/w 

w — k 

„. „ „ . , . . ,,, . , Proof of Lemma 4. 1. Let q be the probability that Y„ < 

Therefore, for every q > Ak, the expectation of Y„ is at least „»,, ,„„,„, , . ,. , , . . 

y E[Y g ]/2. By the Chebyshev inequality and Lemma 4.4, 



□ 
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q < Pr 



|r s -E[F g ]|>E[F g ]/2 
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*HY g ) 
(E[Y g }/2) 2 ~ E[Y g y 



< 



D Therefore, by Lemma 4.2 and the fact that l/(Y g 
always at most one, we have 



1) is 



E 



Ws) + i 



< E 



Y„ + l 



< (1-9) 



E[y fl ]/2 + i 



< 



LEMMA 4.4. The variance a (Y g ) of the random variable 
Y g is at most its expectation E\Y g ]. 

Proof. We show the events Ei are negatively correlated, i.e., 

for every i and j, Pr[E, A Ef\ < Pr^.Pr^], Let F t be 

the event that a given man does not include woman i on his 

preference list. By the independence and symmetry of the 

preference lists of men, we have Pr[_Ei] = (Pr[i 7 i]) n , and 

Pr[.Ej A Ej] = (Pr[Fi A Fj]) n . Therefore, it is enough to which together with Lemma 4.3 completes the proof. 

show that for every i and j, Pr[Fi\Fj] < Pr[Fj]. 

5 Proofs of economic implications 
^Slnore mathematical terms, this means that X„(g) stochastically In this section, we prove the corollaries stated in Section 3. 
dominates Y g . The first of these results argues that a dishonest player has 



WgY 



□ 



economic incentives to be honest when other players are 
honest. 

Proof of Corollary 3.1. Fix a person, say a man named 
Adam and suppose all other players are truthful. Theorem 3.1 
implies that with probability at least 1 — o(l), Adam has at 
most one stable wife, Eve, with respect to the true preference 
lists of the players. Suppose all other players are truthful. 
We claim Adam's best response is truth-telling. Suppose 
not. Allow Adam to play his best response p and let p be 
the matching that the stable matching mechanism outputs. 
Now run the men-optimal algorithm with the same preference 
lists (i.e., p for Adam, and true preference lists for others) 
and let hm be the resulting matching. By Theorem A, 
Adam must prefer his match in /um to his match in p. 
However, by Theorem D, in the men-proposing algorithm, 
Adam's dominant strategy is truth-telling and, by assumption, 
matches him to Eve. Therefore, Adam must prefer Eve to 
his match in pu and thus to his match in p. But Eve is the 
woman that Adam would have been matched to in the original 
mechanism if he had been truthful (since it was his unique 
stable match), and so his altered strategy p was not his best 
response. □ 

The second corollary shows that the game Gp m _p w de- 
fined in Section 3 has an equilibrium in which in expectation 
a (1 — o(l)) fraction of participants are truthful. 

Proof of Corollary 3.2. Suppose we are using the men- 
proposing mechanism (the women-proposing situation is 
analogous). We prove that the following set of strategies 
forms an equilibrium in the game Gp mt p m : all men announce 
their true preferences; all women who have at most one stable 
husband (with respect to P m , P w ) announce their true prefer- 
ences; and all women who have more than one stable husband 
truncate their preference lists just after their optimal stable 
husband. We denote the altered preference lists of women 
by P' w . By Theorem D, men cannot improve their situation 
by altering their strategy. Consider a woman, say Alice, and 
assume Alice will be assigned to Bob if the players use the 
strategies in (P m , P w )- It i s eas Y to see that there is a unique 
stable matching with respect to (P m , P w )- Therefore, if we 
run the women-optimal mechanism on (P m , P w ), we get the 
same outcome as in the men-optimal mechanism. However, 
by Theorem D we know that no woman can benefit from alter- 
ing her preferences in a women-optimal mechanism. Thus, if 
Alice changes her strategy from the one dictated by P w , then 
she gets a match, say Tom, that according to P w is not better 
than Bob. However, by the definition of P w , this implies that 
Tom is not better than Bob according to the true preferences 
of Alice. This shows that the set of strategies (P m ,P w ) is 
an equilibrium. By Theorem 3.1, we know that all men and 
all but at most a o(l) fraction of women are truthful in this 
equilibrium. □ 

Finally, we prove the third corollary, which says that if 
we model the situation as a game of incomplete information, 



then everybody being truthful is an approximate Bayesian 
Nash equilibrium. 

Proof of Corollary 3.3. Since the women-optimal mecha- 
nism is used, we know by Theorem D that truthfulness is a 
dominant strategy for women. It is enough to show that if 
all men and women are truthful, then no man can improve 
his match by more than a (1 + e) factor if he uses a dishonest 
strategy. Fix a man, Charlie. With probability 1— o(l), prefer- 
ences are such that Charlie does not have more than one stable 
wife. In this case, the argument used in the proof of the pre- 
vious two corollaries shows that Charlie cannot gain by being 
dishonest about his preferences. With probability o(l), Char- 
lie has more than one stable wife, and in that case, he might 
be able to improve his match from someone ranked at most k 
in his list to someone ranked first. However, k is a constant. 
Using this, it is easy to verify that on average, he can improve 
his match by at most a factor of 1 + k 2 x o(l) = 1 + o(l). 
Thus, everyone being truthful is an approximate equilibrium 
in this game. □ 

6 Tighter analysis for the uniform distribution 

For the case of uniform distributions, it is possible to derive a 
much tighter bound on the expected number of women with 
more than one stable husband. 

Recall that in the proof of Theorem 3.1, we bounded the 
probability that a fixed woman g is single by E M [l/(X M (g) + 
1)], where X^(g) is the number of women at least as popular 
as g that are single in matching p. In the case of the uniform 
distribution, for every woman g, X^ (g) is equal to the number 
of singles in p. Therefore, if we define the random variable X 
as the number of women who remain unmarried in the men- 
optimal stable matching (recall that by Theorem C, the set of 
unmarried women is independent of the choice of the stable 
marriage algorithm), then we have 



c k( n ) < *lE 



1 



X+l 



Thus, the following lemma shows that if men have 
random preference lists of size k, then the expected number 
of women who have more than one stable partner is at most 



pfc+i 



k . This completes the proof of Theorem 3.2. 



LEMMA 6.1. Let X denote the random variable defined 
above. Then, 



E 



1 



IxTT^ 



3 fe+l 



The proof of the above lemma is based on a connection be- 
tween the stable marriage problem and the classical occu- 
pancy problem. In the occupancy problem, m balls are dis- 
tributed amongst n bins. The distribution of the number of 
balls that end up in each bin has been studied extensively 
from the perspective of probability theory [9]. We denote the 



occupancy problem with m balls and n bins by the (m, n)- 
occupancy problem. The following lemma establishes the 
connection between the number of singles in the stable mar- 
riage game and the number of empty bins in the occupancy 
problem. 

LEMMA 6.2. Let Y m ^ n denote the number of empty bins in 
the (m,n) -occupancy problem and X denote the random 
variable in Lemma 6.1. Then, 

1 -. r 1 -, k 2 

<E 1+ — . 

1 J " [ Y, 



E 



L X 



1 



r (fe+l)n,n 

Proof. We use the techniques of amnesia, the principle of de- 
ferred decisions, and the principle of negligible perturbations 
used by Knuth [10] and Knuth, Motwani, and Pittel [11, 12]. 
Assume every woman has an arbitrary ordering of all men. 
We define the following five random experiments: 

• Experiment 1 is the experiment defined before 
Lemma 6. 1 : every man chooses a random list of at most 
k different women as his preference list. Then, we run 
the men-proposing stable marriage algorithm, and let the 
random variable X\ = X indicates the number of single 
women at the end of this experiment. Notice that in this 
experiment, men do not have to select their entire pref- 
erence list before running the algorithm. Instead, every 
time a man has to propose to the next woman on his list, 
he chooses a random woman among the women he has 
not proposed to so far, and proposes to that woman. It 
is clear that this does not change the experiment. This is 
called the principle of deferred decisions. 

• In Experiment 2, each man names k different women 
at random. We let X 2 be the number of women that no 
man names in this game. 

• Experiment 3 is the same as experiment 2, except here 
the men are amnesiacs. That is, every time a man wants 
to name a woman, he picks a woman at random from the 
set of all women. Therefore, there is a chance that he 
names a woman that he has already named. However, 
each man stops as soon as he names k different women. 
Let X3 be the number of women who are not named in 
this process. 

• In Experiment 4, we restrict every man to name at most 
k + 1 women. Therefore, each man stops as soon as 
either he names k different women, or when he names 
k + 1 women in total (counting repetitions). Let X4 
denote the number of women who are not named in this 
experiment. 

• In Experiment 5 every man names exactly k + 1 (not 
necessarily different) women. The number of women 
who are not named in this experiment is denoted by X$. 



Now, we show how the random variables X\ through X 5 
are related. It is easy to see that for any set of men's prefer- 
ence lists, the number of unmarried women in Experiment 1 
is at least the number of women who are not named in Ex- 
periment 2. Therefore, X\ > X 2 . Also, it is clear from the 
description of Experiments 2 and 3 that X 2 = X 3 . 

In order to relate X3 and X4, we use the principle of 
negligible perturbations. Experiments 4 is essentially the 
same as Experiment 3, except in X4 we only count women 
who are not named by any man as one of his first k + 1 
choices. Let E denote the event that no man names more 
than k + 1 women in Experiment 3. We first show that 
Pr[_E] < k 2 /n. Fix a man, say Homer. We want to bound the 
probability that Homer names at least k + 2 women before 
the number of different women he has named reaches k. By 
the union bound, this probability is at most the sum, over 
all pairs {i,j} C {1, . . . , k + 2} that both i'th and j'th 
proposal of Homer are redundant. It is easy to see that for 
any such pair, this probability is at most 1/n 2 . Therefore, 
the probability that Homer makes more than k + 1 proposals 
is at most ( J )/n 2 < k 2 /n 2 . Thus, by the union bound, 
the probability of this happens for at least one man is less 
than k 2 /n. That is, Pr[E] < k 2 /n. Now, notice that 
by the definition of X3 and X4, the random variables X3 
and X4 are equal when conditioned on the occurrence of 
E. Therefore, E [^\E] = E [^\E]. Let C = 

be the unconditioned difference of 



these expectations. Then 
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where q = Pr[E] and q — Pr[.E]. Finally, we observe that 
by the definition of Experiments 4 and 5, we have X4 > X5. 
The above observations imply 

= E[ 



n 



< E[ 

< E[ 

Y(k+l)n,n + 1 

This completes the proof of the lemma. 
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Clearly, X 5 = Y { 
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By the above lemma, the only thing we need to do 
is to analyze the expected value of l/(Y m ^ n + 1) in the 



occupancy problem. We do this by writing the expected value 
of l/(Y„ lyn + l) as a summation and bounding this summation 
by comparing it term-by-term to another summation whose 
value is known. 

LEMMA 6.3. Let Y m _ n denote the number of empty bins in 
the (m,n)-occupancy problem. Then, 

1 p-m/n 

[ " ]<" • 
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Proof. Let P r (m, n) be the probability that exactly r bins are 
empty in the (to, n)-occupancy problem. Then Po(m, n), the 
probability of no empty bin, can be written as the following 
summation by the principle of inclusion-exclusion. 4 



(6.3) 



Bo(m,n) = J2(-l) 



n d--r 

l I n 



The probability P r (m,n) of exactly r empty bins can be 
written in term of the probability of no empty bin in the 
(to, n — r) -occupancy problem: 



The value of S can be bounded easily using a combinatorial 
argument. 

Consider the (to, n + l)-occupancy problem. The prob- 
ability that at least one bin is empty is the sum, over r = 
1, . . . , n + 1, of P r (m, n + 1). We denote this probability by 
S. By Equation (6.5) we have 



r=l 8=0 



n + 1' 



where the inequality follows from the fact that S is the prob- 
ability of an event. The summation in Equation (6.6) and 
S have the same number of terms, and the ratio of each 
term in the summation in Equation (6.6) to the correspond- 
ing term in S is equal to (1 - ' £± ^ 1 ) m /{l - ^\) m = 

{ n—r—i+1 \ m I ( n+l — r—i \m {'\ _l_ 1 \m 



n+1 
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Lemma 6.1 immediately follows from Lemmas 6.2 and 6.3. 
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(6.4) P r (m,n)=[ (1 - -) m P (m, n - r). 
v r / n 



By Equations (6.3) and (6.4), 



(6.5) P r (m, n) - ^(-l) 4 . (1 - — ) m , 



where ( ", ) denotes the multinomial coefficient ,,.,"• — rvr . 

\a.b/ al o! {n—a—b)\ 

Using Equation (6.5) and the definition of expected value we 
have, using the notation E — E[ Y +1 ], n' = n + 1, and 
r' = r + 1, 
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It is probably impossible to simplify the above summation as 
a closed-form formula. Therefore, we use the following trick: 
we consider another summation S with the same number of 
terms, and bound the ratio between the corresponding terms 
in these two summations. This gives us a bound on the 
ratio of the summation in Equation (6.6) to the summation 5. 

4 This can also be derived by dividing a well-known formula for Stirling 
numbers of the second kind (see, for example, [7, 27]) by n m . 



7 Conclusion 

In this paper we studied the stable marriage game in a 
probabilistic setting and showed that dishonesty almost surely 
does not benefit a player. This answers a question asked 
by Roth and Peranson [23], and generalizes their model to 
one where women have arbitrary preferences and each man 
independently picks a preference list from 3> k , where 2) is an 
arbitrary distribution. One might hope to further generalize 
this model to one where each man picks a random list from 
an arbitrary distribution over lists of size k. However, the 
following example shows that Theorem 3.1 is not true in this 
model: Assume women 1 , . . . , n/2 rank men in the order 
1,2, ... ,n, and women n/2 + 1, . . . ,n rank them in the 
reverse order. Each man picks a random i <G {1, . . . , n/2}, 
and with probability 1/2 picks preference list (i, i + n/2) and 
otherwise picks preference list (i + n/2, i). It is not difficult 
to see that with these preferences, at least a l/(8e) fraction of 
women will have more than one stable husband. 

There are many other interesting open questions sur- 
rounding the application of the stable matching algorithm in 
centralized markets. For example, in the NRMP market, the 
algorithm has to accommodate couples among students who 
want to live in the same city. Such couples can submit a joint 
preference list of pairs of hospitals, and the algorithm has to 
match them to one of the pairs in their list. With this extra 
twist, there are instances for which no stable matching ex- 
ist. However, so far every year the NRMP algorithm has been 
able to find a stable matching. A theoretical justification for 
this (in a reasonable probabilistic model), and a study of in- 
centive properties in mechanisms with couples are interesting 
open directions for future research. Finally, it would be in- 
teresting to find other examples where one can prove that in a 
probabilistic setting, truthfulness is probably the best strategy. 
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A Number of singles revisited 

In this paper, we presented an analysis of the expected 
number of players that remain single in a stable marriage 
mechanism, and used this lemma to prove the main result 
of the paper. However, analyzing the expected number of 
singles in a probabilistic setting is of independent interest. 
In this appendix, we present a tighter analysis of the expected 
number of singles when men have random preference lists of 
size k, and women have random complete preference lists. If, 
in addition to the results of this appendix, one can prove that 
the number of singles is concentrated around its expectation, 
then the bound (e k + k 2 )/n proven in this paper would be 
improved. 

LEMMA A. 1 . Consider a collection of n men and n women, 
each man having a random ordering ofk random women, and 
each woman having a random ordering of men. Let Pk(n) 
denote the probability that in a stable matching with respect 
to these preference lists a fixed man remains single. Then 
Pk(n) > ^. 

In order to prove the above lemma, we first generalize 
the scenario defined in the lemma to a case where there 
are m men and n women (m < n). Let pk(m,n) denote 
the probability that a fixed man remains unmarried in this 
scenario. Therefore, Pk(n) = Pk( n , n )- We start by proving 
that if the population of women remains constant, an increase 
in the number of men can only make it harder for a man to 
find a stable wife. 
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Lemma A. 2. For every k,n,m\, and mi, if mi < m 2 then 
p k (mi,n) <Pk(m 2 ,n). 

Proof. It is sufficient to prove that for every k, n, and to, 
Pk{m, n) < Pkijn + 1, n). Consider a fixed man, Bert, in the 
scenario where there are to + 1 men. We want to compute the 
probability that after running the men-proposing algorithm, 
Bert remains single. By Theorem B we know that the order 
of proposals does not affect the outcome of the algorithm. 
Therefore, we can assume that one of the m + 1 men, say 
Ernie, starts proposing to women only after everyone else 
is done with his or her proposals. Therefore, by definition, 
before Ernie starts proposing, the probability that Bert is 
single is precisely Pk(m, n). If Bert is married at this point, 
then there is a chance that after Ernie starts proposing, he 
becomes single, since his wife might leave him and he might 
not have anyone else in his list. However, if he is single before 
Ernie starts proposing, there is no chance that he gets married 
as a result of Ernie's proposal. Therefore, the probability that 
Bert remains single is not less than Pk(m, n). 

Proof of Lemma A.l. Let c < 1 be a constant that will be 
fixed later. By Lemma A. 2, we have pk(n) = pk(n,n) > 
Pk(\cn\ + l,n). Therefore, we only need to prove that 
Pk(\cn] + l,n) > -^k- The proof of this fact is based on 
the following two inequalities: 



(1.7) P k(\cn\ +l,n) > (^{1 - p k {\cn\,n)i 



(1-8) 



p k {\cn\,n) < c k 



We start by proving inequality (1.8). Consider the 
situation where there are \cn~\ men and n women. Fix a 
man, say Kermit. The probability that Kermit remains single 
is pk(\cn\ , n). Now, consider the men-proposing algorithm. 
Since the order of proposals does not change the outcome, 
we can assume that Kermit will wait until everyone else stops 
proposing, and then he will make his first proposal. At this 
moment, there are at most \cn\ — 1 < en women who are 
married, and at least (1 — c)n women who are still single. Let 
S denote the set of single women at this point. Since Kermit's 
list consists of k randomly chosen women, the probability that 
none of them is in S is at most c k . We claim that if at least 
one of the women in Kermit's list is in S then Kermit will 
find a wife. The reason is that every time Kermit makes a 
proposal, if he proposes to a single woman, the proposal will 
be accepted and the algorithm ends. But if he proposes to a 
married woman, he might start a chain of proposals that will 
either end at a single woman, in which case Kermit ends up 
married, or gets back to Kermit, in which case the set of single 
women does not change and we can repeat the same argument 
for the next proposal of Kermit until he reaches a woman in 
her list that is in S. By this claim, the probability that Kermit 
remains single is upper bounded by the probability that none 
of the women in her list is in S, which is at most c k . 



Now, we prove inequality (1.7). We use a similar 
argument: consider a situation where there are \cn~\ + 1 
men and n women, and fix a man, say Oscar. We prove 
that the probability that Oscar remains single is at least 

/ \ k 

(| (1 — pk(\cn\,n))j . To prove this, consider the men- 
proposing algorithm, and let everyone other than Oscar make 
proposals. After they finish proposing, Oscar enters and 
proposes to the first woman in his list. Let M denote the 
set of married women at this point. The expected size of 
M, is the same as the expected number of married men, 
which is (1 — Pk(\cn\ , n)) \cn~\ by the definition of Pk{rn 1 n). 
Therefore, there is a probability of 1^1(1 — p k {\cn\ , n)) > 
c(l — pk(\cn\ , n)) that Oscar proposes to a married woman. 
A married woman rejects a new proposal with probability 
at least 1/2. Thus, there is a probability of at least | (1 — 
Pk(\cn\,n)) that Oscar's marriage proposal is rejected by 
her favorite woman. Similarly, the same can happen for 
her second favorite woman, and so on. Therefore, there is 
a probability of at least (f (1 — pk(\cn],n))) that Oscar 
faces rejection immediately after each of his proposals, in 
which case, he ends up single. 

Inequalities 1.7 and 1.8 imply the following 

Pk{\cn\+l,n)>(^{l-c k )) k . 

Choosing c = k~ x l k in the above inequality completes 
the proof of the lemma. □ 



